Picture for Taylor W. Killian

Taylor W. Killian

LARK: Learnability-Grounded Trajectory Selection for Efficient Reasoning Distillation

Add code
May 28, 2026
Viaarxiv icon

Efficient Agentic Reasoning Through Self-Regulated Simulative Planning

Add code
May 21, 2026
Viaarxiv icon

Improving and Accelerating Offline RL in Large Discrete Action Spaces with Structured Policy Initialization

Add code
Jan 07, 2026
Viaarxiv icon

K2-Think: A Parameter-Efficient Reasoning System

Add code
Sep 09, 2025
Viaarxiv icon

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Add code
Jun 17, 2025
Figure 1 for Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective
Figure 2 for Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective
Figure 3 for Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective
Figure 4 for Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective
Viaarxiv icon

SAINT: Attention-Based Modeling of Sub-Action Dependencies in Multi-Action Policies

Add code
May 17, 2025
Viaarxiv icon

Continuous Time Evidential Distributions for Irregular Time Series

Add code
Jul 25, 2023
Figure 1 for Continuous Time Evidential Distributions for Irregular Time Series
Figure 2 for Continuous Time Evidential Distributions for Irregular Time Series
Figure 3 for Continuous Time Evidential Distributions for Irregular Time Series
Figure 4 for Continuous Time Evidential Distributions for Irregular Time Series
Viaarxiv icon

Risk Sensitive Dead-end Identification in Safety-Critical Offline Reinforcement Learning

Add code
Jan 13, 2023
Figure 1 for Risk Sensitive Dead-end Identification in Safety-Critical Offline Reinforcement Learning
Figure 2 for Risk Sensitive Dead-end Identification in Safety-Critical Offline Reinforcement Learning
Figure 3 for Risk Sensitive Dead-end Identification in Safety-Critical Offline Reinforcement Learning
Figure 4 for Risk Sensitive Dead-end Identification in Safety-Critical Offline Reinforcement Learning
Viaarxiv icon

Medical Dead-ends and Learning to Identify High-risk States and Treatments

Add code
Oct 08, 2021
Figure 1 for Medical Dead-ends and Learning to Identify High-risk States and Treatments
Figure 2 for Medical Dead-ends and Learning to Identify High-risk States and Treatments
Figure 3 for Medical Dead-ends and Learning to Identify High-risk States and Treatments
Figure 4 for Medical Dead-ends and Learning to Identify High-risk States and Treatments
Viaarxiv icon

An Empirical Study of Representation Learning for Reinforcement Learning in Healthcare

Add code
Nov 23, 2020
Figure 1 for An Empirical Study of Representation Learning for Reinforcement Learning in Healthcare
Figure 2 for An Empirical Study of Representation Learning for Reinforcement Learning in Healthcare
Figure 3 for An Empirical Study of Representation Learning for Reinforcement Learning in Healthcare
Figure 4 for An Empirical Study of Representation Learning for Reinforcement Learning in Healthcare
Viaarxiv icon